Assigning the Correct Word Class to Punjabi Unknown Words using CRF

نویسندگان

چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Word Class Prediction of Ambiguous and Unknown Words of Punjabi Language Using Bi-gram Methods

Ambiguous and unknown words are found in every language. Ambiguous words are the words having different meaning in different sentences depending upon the context of the sentence. Assigning the correct word class to these ambiguous words is the fundamental task in almost all the NLP applications. A lot of work has been done on this and a lot of work is still to be done. Many statistical and rule...

متن کامل

Using Unknown Word Techniques to Learn Known Words

Unknown words are a hindrance to the performance of hand-crafted computational grammars of natural language. However, words with incomplete and incorrect lexical entries pose an even bigger problem because they can be the cause of a parsing failure despite being listed in the lexicon of the grammar. Such lexical entries are hard to detect and even harder to correct. We employ an error miner to ...

متن کامل

Guessing the Correct Inflectional Paradigm of Unknown Croatian Words

A real-life morphological analyzer must be able to handle properly the out-of-vocabulary words. We address the task of guessing the correct inflectional paradigm of unknown Croatian words. We frame this as a supervised machine learning problem: we train a model for deciding whether a candidate lemma-paradigm pair is correct based on a number of stringand corpus-based features. Our aim is to exa...

متن کامل

To Find the Pos Tag of Unknown Words in Punjabi Language

The accuracy of unknown words in the task of Part of Speech tagging is one significant area where there is still room for improvement. Because of their high information content, unknown words are also disproportionately important for how often they occur, and increase in number when experimenting with corpora from different domains. One area however, where all POS tagging methods suffer a signi...

متن کامل

Pruning False Unknown Words to Improve Chinese Word Segmentation

During the process of unknown word detection in Chinese word segmentation, many detected word candidates are invalid. These false unknown word candidates deteriorate the overall segmentation accuracy, as it will affect the segmentation accuracy of known words. Therefore, we propose to eliminate as many invalid word candidates as possible by a pruning process. Our experiments show that by cuttin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of Computer Applications

سال: 2016

ISSN: 0975-8887

DOI: 10.5120/ijca2016909684